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mTRODUCTiaS 

In  the  past,  the  interpretation  of  multivariate  systems  has  been  dealt 
with  in  several  ways  from  a  working  methodological  point  of  view.  First  is 
the  inspection  of  arrays  of  siaiple  correlation  coefficients  either  at  random 
or  in  some  systematic  fashion.  Then,  from  the  magnitude  of  these  simple 
correlation  coefficients  and  knowledge  concerning  the  variables  involved, 
the  experimenter  or  statistician  will  try  to  extract  what  information  Is 
needed,  perhaps  by  use  of  mailtiple  regression  or  Just  from  the  correlations 
themselves . 

Secondly,  the  interpretation  of  the  multivariate  system  may  be  such  that 
prediction,  classification,  or  the  regress icm  of  one  variable  on  another  is 
the  goal  of  toe  interpretation.  In  this  case,  the  methods  involved  fulfill 
their  aim  qjiite  successfully. 

Since  in  many  analyses  of  multivariate  systems  the  inteipretation  of  the 
system  is  the  infoimation  sought  because  it  is  the  answer  to  some  of  the 
questions  concerning  the  system,  the  author  has  considered  the  following 
question:  Wl^  is  multiple  regression  analysis  and  inspection  of  arrays  of 
siniple  correlation  coefficients  not  adequate  for  the  experimenter  in  applied 
areas  vho  has  much  knowledge  in  his  own  area  concerning  a  particular  system? 


TPhis  knowledge,  which  aaay  be  of  the  form  of  experience  in  working  with 
varioiis  variables  or  of  some  other  form,  can  be  used  to  advantage  and  will 
be  one  basic  assumption  underlying  the  methods  which  follow. 


Vv'i- 


a 


Oae  possible  answer  might  be  that  he  has  sufficient  knovledge  of  a  system  in 
his  ovn  field  to  make  any  information  vMch  the  multiple  regression  or  simple 
correlations  may  provicle  just  verification  of  vhat  he  already  suspects.  This, 
in  some  cases,  may  be  his  goal;  but  in  other  inataoces,  may  be  vasted  time 
and  effort  for  he  may  also  be  interested  in  obtaining  some  additional  infor- 
mation frcm.  the  data  \rtiich  may  be  disguised  by  multivai^iate  interaction. 
Pursuing  this  line  of  thoTight,  it  might  se«n  reasonable  to  ask  one  more 
question.  Can  any  additional  information  be  gained  from  6ing)le  conflation 
coefficients  in  a  multivariate  system  if  knowledge  and  an  aesungption  toward 
the  system  is  taken  and  felt  to  be  true  by  the  experimenter?  A  partial 
answer  to  this  question  might  be  given  if  one  were  to  think  in  terms  of 
systems  of  correlation  coefficients  associated  with  a  multivariate  system. 
Wright  (195^)  has  considered  such  an  approach. 

Carrying  the  idea  of  a  system  of  correlation  coefficients  farther,  it 
oMy  be  informative  to  visualize  sucsh  a  system  from  two  angles .  First,  one 
my  consider  the  usual  descriptive  statistical  side  (Wright  195^).  On  the 
other  hand,  there  i»  the  interpretive  appix)ach  in  ^Ailch  one  considers  causal 
relations.  This  method  of  approach  leads  to  an  inteipretable  solution  to 
the  problem* 

Pirst,  to  consider  what  has  already  been  done  along  the  line  of  inter- 
pretation of  a  multivariate  system  and  what  fruitful  ideas  can  be  obtained 
from  the  literature,  an  investigation  and  short  svirvey  of  path  analysis  and 
the  method  of  path  coefficients  is  warranted.  The  original  work  in  this 
area  was  published  by  Sewall  Wright  in  I918  and  1921.  Subsequently,  it  has 


"beei  elaborated  and  re-explained  by  the  same  teclmiqvies  in  other  vorisa  vhich 
appeared  in  193^*^,  195^,  1960a,  and  1960b.  Also,  to  a  very  limited  extent, 
others,  nanely,  Tukey  (195^);  U  (1956);  and  Turner  (1959,  19^1),  have  pub- 
lished \riiali  seem  to  be  variations  and  re  interpretations  of  the  saoie  techniqjieB. 
Several  applications  of  the  techniques  have  been  published  by  Lu  (1959>  19^2) . 

It  seems  that  no  one  has  really  presented  the  interpretaticai  of  a  system 
of  correlation  coefficients  from  the  viewpoint  exclusively  of  an  analysis  of 
correlation  vith  a  practical  solution  provided  by  a  computer  program.  There- 
fore, it  is  the  intent  of  the  author  in  this  paper  to  develop,  in  matjrlx  form, 
the  technioues  for  such  an  analysis  of  correlation  in  vhich  the  underlying 
raultivaxiate  system  is  symmetric  and  closed,  and  to  present  a  practical  means 
for  \\f>r\t^^^nQ  M^-order  multivariate  systems. 


fESEOH 


Consider  a  symmetric  syst^  of  eo«?i6latlon  coefficients  denoted  by 


(1) 


Tihere  s  is  the  order  of  the  system  and  r .  .  are  simple  correlation  coefficients 


with  r^ ,   <  1  if  1  "/  j  and  r .  »  1  if  i  »  j .  Further  assume  that  this 
system  is  associated  with  a  determinately  closed  syimnetric  multivariate  system 
in  which  the  underlying  model  has  a  defined  point  of  view.  By  a  determinately 
closed  system  is  meant  a  system  of  correlation  coefficients  in  which  no  fuarther 
variation  can  be  explained  by  the  addition  of  more  factors  to  the  associated 
multivariate  system. 

It  has  been  shown  by  Wri^t  (l960a)  that  the  system  in  its  matrix  repre^ 
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and  m  is  the  effect  factor  of  the  system.  Note  also  from  (l)  that  s  »  n  ♦  1. 
In  considering  an  analysis  of  correlation  of  this  system,  it  will  be  neces- 
sary to  woi^c  with  each  conrponent  of  R. 

At  this  point,  two  terms  must  be  defined,  namely,  direct  effects  and 
cong>ound  effects.  By  &  direct  ^££%Q%  shall  b«  meant  the  direct  Influence 
of  the  ith  factor  to  the  mth  factor  of  the  t.     correlation  in  the  system, 
denoted  by  p.  .  A  compound  effect  is  the  indirect  influence  of  the  other 
factor(s)  exclusive  of  the  ith  factor  of  the  r.  correlation  in  the  syat«m» 
It  should  be  noted  here  that  the  column  vector  P  is,  in  terms  of  Wright's 
(1934,  1960a)  work,  known  as  a  vector  of  path  coefficients  of  the  deter- 
minately  closed  system.  Further,  Wright  (193**')  has  shown  that  the  p^ 
measure  the  directional  contribution  of  the  ith  factor  to  the  mth  factor, 
since  the  p^  have  both  magnitude  and  direction  and  are  independent  of 
physical  units,  because  they  are  standardized  partial  regression  coef- 
ficients. Thus,  by  pooling  the  other  additive  factors,  which  contribute 


to  r.  ,  exclusivt  of  the  p^,  a  measure  of  the  indirect  ccoitxlbution  to 


r.  Is  obtained.  By  definition,  the  conipound  effect  is  an  Indirect  contri- 
bution. 

Since  a  ayionetric  system  of  correlation  coefficients  is  to  be  considered 
and  an  investigation  of  the  correlation  coefficients  between  the  causal 
factors  and  the  effect  factor  is  desired,  it  will  be  advantageous  to  consider 
»  diagram  for  each  system,  which  will  represent  the  detemiinately  closed 
system.  Thus,  for  lack  of  better  means,  the  convention  of  Wright  (193^,  1960a) 
is  followed  in  the  use  of  unidirectional  and  bidirectional  arrows  supplemented 
by  appropriate  correlation  and  path  coefficients  as  representative  of  (l) . 
Following  this  p3X)cedure,  consider  figure  !• 
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Figure  1.  A  detexttlnately  closed  symmetric  multivariate  system.  S^»  of  the 
features  of  the  preceding  diagxaa  are:  m  represents  the  effect  factor  and 
1>  2,  3,  ...,  n  twpresent  the  causal  factors.  The  q.  is  the  residual  factor 
and  is  assumed  to  be  uncoirelated  vith  the  variables  1,  ...,  n  in  the  closed 
system.  Also,  note  that  the  p^^  (i  =  1,  ...,  n)  are  represented  by  directional 
arrows  eaqpressing  the  directional  influence  of  each  p^.  The  r^^,  arc  repre- 
sented by  bidirectional  arrows  indicating  their  associative  relationship. 

For  purposes  of  convenieo<»jj  the  square  synnaetric  matrix  A  is  partitioned 
Into  n  row  vectors  in  the  f olloirlng  maimer: 
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How  it  can  be  seen  that  each  A.P  =  R^  (i  «  1,  ...,  n),  where  the  R^  are  de- 
fined by  f  „,  ^' 
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is  an  equation  \rtiich  can  be  partitioned  into  a  direct  aad  goi^imd  effect. 

Thus        .  .,,:•(.. 

B.  ■  (Directional  effect  of  the  ith  factor)  +  (Corapound  effect 
of  the  other  factors). 


For  example, 

n 

n 

Xa  flneral;  let  us  represent  R^  by 

Sow,  for  purposes  of  clarity,  consider  the  followiiag  analysis  of  correlation 

table: 

Table  1  -  Analysis  of  Correlation 

Effect  Contribution 


n 

Total  correlation  R. 

i 

It  is  easily  seen  that  table  1  displays,  in  clear  tenas,  that  part  of  the 
total  correlation  vhich  is  due  to  the  ith  factor  directly  and  also  that  part 
■iriaich  is  indirectly  responsible  for  the  rest  of  the  total  correlation.  In  a 
system  of  order  b,   there  will  be  s  correlation  coefficients  which  may  be 
cmalyzed.  These  are  represented  by  the  vector  R.  Further  information  can 
be  obtained  if  one  wishes  to  partition  the  compound  effect  into  individual 
Indirect  effects  of  the  s  -  1  other  factors  of  the  system  excliisive  of  the 
ith  factor.  To  see  this,  consider  table  2. 
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Table  2  -  Aiialysls  of  Correlation 


Effect 

Contribution 

Direct 

Pim 

Compound 

i  -i.p.j^^i) 

Due  to  1 

Due  to  2 

• 

Due 'to  n 

^11^1  m 
^i2P2m 

• 

^i^nm 

Sotal  correlation   R. 

One  can  easily  sec  the  advantage  of  table  2  since  each  factor,  exclusive  of 
the  ith,  appears  with  a  magnitude  of  the  contributing  or  detracting  effect, 
whichever  it  may  be,  it  has  on  the  total  correlation. 

A  determinately  closed  systen  has  been  under  consideration,  but  there 
has  been  no  expression  of  the  detenaination  of  the  closed  si'stem  by  a  quanti- 
tative measure.  It  is  apparent  that  the  roeasure  should  be  of  such  nature 
that  it  will  be  in  the  range  from  zero  to  one.  Thus,  by  multiplying  both 
sides  of  eqiiaticaa  (2)  on  the  left  by  P',  one  has 

P'  (AP)  =-  P'R,  '  (^) 

and  by  the  law  of  associativity  of  matrix  multiplication 

(P'A)  P  •  P'R.  (5) 

Letting 

f 'A  -  B  end  P'B  -  S 
the  equation  beconea         ?■  • 

BP  -  S.  (6) 


It  iaay  not  be  obvious,  but  upon  investigation,  it  can  be  seen  that  S  is  of 
dimension  1x1  since  Bj^  ^^  ^P^  ^^  ^  =  S^  ^  ^}   thus,  it  is  a  scalar  and  i» 
identical  to  the  coefficient  of  determination,  or  the  squared  multiple  corre- 
lation coefficient  as  it  is  Imovri  in  multiple  regression  theory.  Now,  for 
the  system  in  (2),  there  is  e  measure,  in  the  appropriate  range,  which  is 
representative  of  the  amount  of  variation  in  m  which  is  accounted  for  by 
the  factors  1,  .*.,  n  in  the  closed  system.  In  addition,  it  is  known  that 

S  +  (l  -  S)  »  1  (7) 

«»"■   ■ 

s  +Q  =  1  ' '  '   '..-^    (8) 

where  Q  «  1  -  S.  Thei^ joi«,  if  the  unescplained  variation  in  the  system  is 
of  interest,  it  is  seen  that  Q  is  such  a  mesisure.  Thus,  it  will  be  convenient 
to  define  Q  as  the  coefficient  of  Indetermination  of  the  system.  It  is  inter- 
esting to  observe  that  if  Q  =»  0,  then  S  ■  1,  and  the  determinately  closed 
system  becomes  a  completely  closed  system.  If  S  is  something  less  than  one, 
then  the  degree  to  irtiich  the  system  is  closed  is  S  x  100^.  It  is  noteworthy 
also  to  observe  that  the  coefficient  of  dstermination  S  is  the  sum  of  n  tena 
namely    .  r-    ,  .,  ,',_,,. 

•w^ra  each  of  the  n  terras  on  the  right  of  (9)  is  representative  of  the  con- 
tribution that  the  ith  variable  (i  =»  1,  ..•,  n)  makes  to  the  total  deter- 
mination of  the  whole  system.  The  significance  of  this  additive  breakdown 
lies  in  the  fact  that  each  variable  may  be  ranked  according  to  its  injKDrtance 
in  the  systea. 
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To  8umiiiari2ie  the  «inal;^'sis  of  correlation,  consider  •fcahle  3  for  the  special 
case  of  a  syatem  of  correlation  coefficients  of  order  5«  One   can  observe  tliat 
there  is  one  colunm  of  the  table  for  each  variable  and  for  a  system  of  order 
less  than  8  this  is  a  concise  method  for  suiataarizlng  the  results.  For  higher 
order  systems,  the  conq)ater  progrcun  considei«d  later,  will  have  an  output 
format  vhich  vill  also  be  in  a  soine^rtiat  similar  tovm  for  easy  inspection. 

HJMBRICAL  EXAMPIE 

A  publication  which  appeared  in  1959  hy  Bsfwey  and  Lu  1«  an  excellent 
exanrple  of  the  application  of  the  previous  method.  This  publication  dealt 
with  crested  wheat  grass  seed  production  which  is  currently  of  aoich.  interest 
to  the  plant  breeder.  Data  were  collected  on  open-pollination  progenies  of 
79  standard  crested  wheatgrass  plants  at  the  Evans  farm,  which  is  part  of 
the  Utah  State  University  Agricultural  Experiment  Station,  located  near 
Logan,  Utah.  The  authors  of  the  above  paper  were  interested  in  an  investi- 
gation of  the  associations  between  the  characteristics  on  which  measure - 
asnts  were  taken  tor  the  pux^se  of  determining  a  selection  program  to  use 
in  order  to  increase  seed  production.  These  characteristics  were  seed  size, 
spikelets  per  spike,  fertility,  plant  sise,  and  seed  yield. 

Table  k  displays  the  results  in  which  the  following  points  mi^t  be 
brought  out  as  an  example  of  the  usefulness  of  an  analysis  of  correlation 
In  a  determinately  closed  symmetric  multivariate  sjTStea.  Upon  inspect i<m 
of  the  conflations  between  seed  yield  and  the  other  four  factors  which 
are  assumed  to  detemiine  it,  it  may  be  observed  tliat  from  the  conflation 
coefficients  alone,  one  would  suspect  that  spilcelets  per  spilce  would  be  the 
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TABIiE  h 


Seed  size 


SpliBlets  per  spike 


Fertility 


Plant  size 


Seed  yield 


Direct  Effect 


Pooled  indirect  Effect 


Indirect  Effect 
due  to  factor 


Determination  of  factor 
Determination  of  systeia 


Factor 


Seed   Spikelets  Plant 

Size   per  spike   Fertility   size 
X  2         3       4 
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most  iniportant  factor  in  detewiining  seed  :,deld.  This  would  probably  re- 
sult by  observing  that  .^^77  >  .W«D  >  .256  >  .005  which  are  the  correlation 
coefficients  between  seed  jleld  and  spikelets  per  plant,  plant  si»e,  fer- 
tility, and  seed  size  respectively.  However,  by  analyzing  the  vector  of 
correlations  (represented  by  the  fifth  row  in  table  h)   between  seed  yield 
end  its  detomdning  factors  for  the  direct  and  indirect  effects  of  each 
correlation,  one's  opinion  ;)OSt  assuredly  would  be  changed  as  to  the  im- 
portance of  spikelets  per  spike  in  determining  seed  yield  when  it  is  seen 
that  the  indirect  effect  of  the  otlier  factojw  is  the  effect  which  is  contri- 
buting to  the  magnitude  of  the  correlatioa  between  seed  yield  and  spikelets 
per  spike.  Therefore,  by  considering  the  direct  effects  of  each  factor,  It 
Is  most  apparent  that  fertility  and  plant  size  are  the  factors  to  be  noted 
In  determining  what  procedure  to  use  to  increase  seed  yield. 

By  the  use  of  an  analysis  of  correlation,  as  appeared  in  the  aforemen- 
tioned publication,  it  was  possible  to  suggest  those  factoxv  in  the  deter- 
minately  closed  system,  namely,  fertility  and  plant  size  which  the  plant 
breeder  should  use  in  his  selection  program. 

The  usefulness  of  the  method  is  suramarized  by  Dewey  and  Lu  (l959)  in 

the  following  rriannerj 

As  more  variables  are  considered  in  the  correlation  table,  these 
indirect  associations  become  raore  complex,  less  obvious,  and  somewhat 
perplexing.  At  this  point,  the  path-coefficient  analysis  provides  an 
effective  oeans  of  untangling  direct  and  indirect  causes  of  association 
and  permits  a  critical  exairdnation  of  the  specific  forces  acting  to 
produce  a  given  correlation  and  measures  the  relative  inqportance  of 
each  causal  factor. 


Ik 


>'  I'i',, 


COMPlJilEB  PSOGRAM 


Function 

The  objective  of  an  analysie  of  correlation  progiram,  fzx>m  a  mathematical 
point  df  riw,   is  to  find  the  solution  to  higher  order  correlation  eyatttos 
of  the  kind  :. 

vhere  A  is  nonslngular  and  positive  definite  by  finding  the  solution  vector 

p  -  a"-Si  ■' 

vriaich  involves  obtaining  the  inverse  to  the  correlation  loatrix  A.    Then,  by 
suitably  partitioning  R  and  A,  one  can  find  the  direct  and  indirect  coef- 
ficients that  laalce  up  each  correlation  in  the  vector  R.    Thus,  the  eq^uation 
l3«co:iies 

.     ':":■-■'/■:  A,p.R,  ■    . . 

for  all  i  =  Ij  2,  3>  -..,  n.  Then,  upon  obtaining  the  solution  vector,  P' 
is  also  deterralaed  so  that  it  is  nov  postlble  to  calculate  the  coefficient 
of  determination 

.-      ■;  -•      ■  -•=..     .•      ---^    i     ..  -      .  :     .     ,    - 
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'2a 


*21     ^22    •••  *' 


ft"     •  ft     -t       •  •  •      ft 

nl      n2  nn 


,  Trtiere 


<  1  if  ly.  j 
»  1  if  l»  J. 


«ad  by  augiaenting  A  on  the  right  by  the  identity  inatrix  I^^  , 


AI 


n 


,r-'V--:  -■■■    ■■ 


*ij 


^1     *12 

•  •  • 

*ln 

1 

0 

...     0 

*21     *22 

•  •  • 

*2n 

0 

1 

...     0 

«  '      «  . 

• 

• 

• 

• 

*          m 

« 

• 

• 

« 

»     .■•--■. 

• 

« 

• 

• 

>  V 

•   •   i 

V 

0 

0 

...   1 

leinentary 

row 

operations  of  t^-pe  II 

^/^i 

f> 

and  of  type  III  being 

^J  -  *kj  -  *ii*ij 
for  i,  i  "  I,    . . .,  n,  AI  can  be  reduced  to  the  form 


AI_ 


0 

i 


,..  0  b^  bj^ 


...  0  b23_  bgg 


In 


2n 


«  t 


1  ^nl  ^n2 


nn 


I  B 

n 


where 


How  by  taking  the  product 


B  -  A 


BR 
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which  is  the  se-T*  as.5,  b,,r.  for  all  ;)  •»  1,  .♦.,  n,  ti^  »olutic«i  to  the 

vector  P  can  be  obtained.  By  coiisidering  each  ith  row  separately,  one  can 


now  analyze  each  element  of  the  vector  R.  Thua 


Then  by  taking  the  product 


or 


A.P  =  R.  . 

X         i 


p«R  =  S 


the  coefficient  of  determination  S  is  obtained. 
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Flov  Chart 


Bead  In  order 
of  syftem 


Set  Ml  »  M  •(•  1 
M2  =  M  +  M 
Ml^  =  M  -  1 
M3  «  Mi^  -f  M 


Set  cui^.ulatlve 
constants  «  0 


Read  in  elements 
of  correlation 

matrix 
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Augment  matrix  on 
right  by  identity 
nusitrix 


Compute  matrix 
inverse  using 


=  x^ 


^i 


(o 


Partition  matrix  A 
and  compute 
A.P  »  R.  for  i  «  1,  ...,  n, 
Analysis  of  Correlation 


Coii5)ute 

determination 

of  system 


8 


flKop 


Description  of  Flow  Chart 


Box  1:   Tbe  order  M  of  tl^e  system  is  read  into  memory. 
Box  2:   The  limits  for  the  suraraations  to  be  made  later  are  set. 
Box  3:   A  pair  of  dummy  constants  sa%  used  for  the  initial  starting  point 
of  any  summation.  These  are  set  equal  to  zero. 
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Box  k:       The  eleroents  of  the  conrelation  matrix  are  reaA  and  put  into  memory. 

Box  5:   An  identity  niatrix  of  order  M  is  augmented  to  the  right  of  the  corre- 
lation matrix. 

Box  6:   Two  elenientary  matrix  operations,  namely,  type  II  and  type  III  oper- 
ations, are  used  to  reduce  the  correlation  matrix  A  to  the  identity 
matrix  vhile  at  the  same  time  the  same  operations  that  vere  performed 
on  A  are  performed  on  the  augmented  identity  matrix  to  finally  reduce 
it  to  a  matrix  A   vhich  is  the  inverse  of  A.  Then,  the  vector  P  is 
found  by  taking  the  product  A  k. 

Box  7:   Bach  correlation  R.  of  the  vector  R  is  now  found  for  i  ■  1,  ...,  n 

n 
by  taking  the  product X,  ^i^P*  ^^^  ^°^  9euch.  i  and  j  is  printed  out. 

Box  8t   The  detenaination  of  the  system  is  then  computed  by  taking-Zur^p.  »  8. 


FORTRAN  PROGRAM 


1  FORMAT(lM 

2  F0RMAT(F10.6) 

71  F0RMAT(2^  ANALYSIS  OF  CORREUTION) 

72  F0RMAT(28HC0EFFICIENT  OF  DETERMINATION) 

80  FORMAT(33H  DIRECT  CONTRIBUTION  COEFFICIENTS) 

81  FORMAT (//) 
83  FORMAT (/) 

100  F0RMAT(F18.6,I4,IM 
310  F0RMAT{F10.5J4,I4) 

DIMENSION  S(20,20),B(20),ANAL(20),BAL(20),R(20,20),T(20) 
C  INITIALIZE  AND  READ  IN  DATA 
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51  READ1,M 
MI-M-t-] 

MiWM-1 
M3-M4+M 
DET-0.0     ' 
ANAL(0)-0.0 
D05I-1.M 
D05J»t-M 
REA02,S(I.J) 
R(I,J)=S(I,J) 
5  CONTINUE 
AUGMENT  MATRIX  ON  RIGHT  BY   IDENTITY  MATRIX 
D08I«1,M 
11-l+M 
007J-M1,M2 

7  S(l,J)-0.0 

8  S(IJ1)-1.0 

COMPUTE  DIRECT  CONTRIBUTION  COEFFICIENTS  AND  INVERSE,TYPEOUT  COEFFS 
PRINT81 
PRINT80 
PRINT81 
D022I-1,M 

10  FAC»S(I,I) 

IF(I-M)17,n,17 

11  D013J-1,m4 

12  I1-J+M 
B(J)-S(J,M) 

13  PRINT310,B(J),J 
PRINT83 
0016J«^UM3 

14  I1.J-M 
0016K-I1,M4 

15  TEMP3-S(K,J) 
IF(SENSE  SWITCH  3)61,16 

61   PRINT100,TEMP3,K.  II  .    .. 

16  CONTINUE 
PRINT83 

17  D018J-1,M2 

18  S(l,J)oS(l,J)/FAC 
D022K«1,M 
IF(K-I)20,22,20 

20  FAC«S(K, I) 
0021J«I,M2 

21  S(K.J)=S(K,J)-FAC*S(I,J) 

22  CONTINUE 


80 


C  PUNCH  AND  PRINT  OUT  INVERSES 
00201 K»1.M 
D0200JaM1,M2 
IF(SENSE  SWITCH  3)200,201 

200  TYPE100,S(K,J),K,J 

201  CONTINUE 

C  PRINT  OUT  ANALYSIS  OF  CORREUTiON 
PRINT83 
PRINT71 
D0301I-1,M4 
BAL(l)-0.0 
D0300J-1,Mi» 
ANAL(J)-Jl(l,J)*B(J) 
BAL(l)«BAL(l)+ANAt(J) 

300  PRINT310,ANAL(J).I,J 
PRINT310,BAL(I) 
PRINT83 

301  CONTINUE 
PRINT72 
PRINT83 
DO^OJ-1  M 

400  DET»DET+R(M,J)*B(J) 
PRINT2,0ET 
PAUSE 
GO  TO  51 
END 


=  '■!  \ 


DIRECT  CONTRIBUTION  COEFFICIENTS 


.22973  1 

.31676  2 

1.12189  3 

.91530  k 


ANALYSIS  OF  CORRELATION 


.22973 

1 

1 

.08679 

1 

2 

-.79206 
.48053 

1 

3 

1 

4 

.00500 

.. 

,- 
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M2$k  2   1 

.31676  2   2 

-.33656  2   3 

.43385  2   4 
.47699 

-.16218  3   1 

-.09503  3   2 

1.12189  3 

-.60867  3 
* 25600 


i 


.12060  4       1 

.15014  4      2 

-.74606  4 

.91530  4 
.44000 


I 


COEFFICIENT  OF  DETERMINATION 
.842187 

■'•■'•■      ■   .     C 

Discussion  of  the  Prograoi 

The  fortran  program  presented  is  capable  of  handling  a  system  of  correla- 
tion coefficients  up  to  order  20;  thus,  the  limits  on  M  are  2  <  M  <  20.  The 
elements  of  the  correlation  matrix  A  are  read  in  by  punched  card  starting 
vlth  the  elcznent  in  the  first  rov  and  first  colwm  and  followed  by  the 
succeeding  elements  in  each  row  as  we  move  down  the  row  vectors  in  the  matrix. 
Sach  correlation  is  punched  oja  a  o^^urate  card  in  a  ten-digit  I'ield  along  vlth 
the  identification  of  its  srav  aoA  coXtBnn  position  in  the  matrix. 

By  ths  use  of  sense  switch  control,  it  is  possible  to  have  printed  out 
the  inverse  of  the  correlation  matrix  if  sense  switch  number  three  is  on. 
..   The  output  medium  is  done  by  typewriter  and  is  self-explanatory. 
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A  tnethod  has  been  developed  for  a  very  special  miiltivariate  system  where 
one  factor  can  be  considered  and  assiimed  to  be  determined  by  n  other  factors. 
The  correlations  are  assumed  to  be  known  or  available  but  if  not,  can  b« 
coaputed  frora  saarple  data.  The  method  can  be  very  useful  for  higher  oirder 
•ystesis  wliere  the  interaction  of  many  factors  liecoiaes  practically  iuqpossible 
to  separate  and  interpret. 

It  is  very  interesting  as  well  as  comforting  to  know  that  a  solution 
al«a(ys  exists  for  the  Bystcm  since  the  correlation  matrix  A  is  nonsingular 
and  positive  definite.  ThiiS,  when  one  seeks  the  practical  solution  to  a 
tymtein  AP  «  R,  the  solution  vector  P  is  obtainable  since  A   will  always 
exist  for  such  a  system. 

Like  any  laethod  which  is  developed,  it  is  much  easier  to  use  and  appre- 
ciate this  method  if  there  is  both  a  practical  and  usable  means  for  applying 
it.  The  computer  program  presented  furnishes  the  means  to  this  end,  and 
affo3rds,  in  addition,  an  available  way  for  verification  of  the  method.  The 
program  presented  allows  one  to  analyze  systems  of  correlation  coefficients 
\Q)  to  order  20.  With  slight  alteration,  systetas  up  to  order  50  could  also 
be  handled. 

It  is  quite  apparent  that  the  ultimate  usefulness  of  the  method  depends 
upon  the  ingenuity  of  the  user  to  apply  it.  The  deter.uination  of  the  effect 
factor  is  Mcuoted  to  be  imldlrectionaLly,  linearly,  and  completely  determined 
by  the  other  faetort  of  th*  sj^tcm  and  therefore,  careful  consideration  must 
be  used  in  applying  the  method.  As  Tukey  (l95'»-)  points  out,  the  original 
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vork  of  Wright  (I9l8>  192l)  vas  an  atteo^jt  to  go  from  a  puarely  descriptive 
loethod  of  analyzing  data  to  a  functional  level  treattoent  of  data.  However, 
Wright's  method  ia  considered  "by  Tukey  and  others  to  be  probably  at  a  level 
somevhere  between  that  irtiich  they  call  a  tangential  level.  The  method  pre- 
sented is  a  special  case  where  a  determinately  closed  system  is  considered 
so  that  the  assumption  of  being  completely  determined  ia  only  met  when  the 
deteiwlnation  of  tlie  system  is  S  =  IOO5&. 
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APPEin}IX 


% 


t9 


tfat  uzidsrlying  modal  of  fim  stultlvariat*  syvtem  has  tba  form  (Vright  1960a) 
^^lilllag  tb«  Arrlvtioa  from  th»  raMKUi  for  Mieh  texa  mA  Alvldine  each  t«ni 


by  #^iHBi  h«wt 


viiere 


•m      V    'n     ^        ^     V    «r    ^ 


fia'Vj,  (i  -  1,  2,   ...,  n,  r). 

V 

The  Mt  of  •quKtloas  for  the  eolutioa  to  (2*)  are 

'2lPlm  *  J'am  *  —  ♦  ^2n»«ua  "  '2m  ^3*> 


'nlPlm*'n2P2m*  •••♦»««"'. 


not 
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By  using  laatrix  notation,  ve  can  express  (3')  by 

AP  -  R 

■.-■^.  -,  ■  ,  ■  :    , 

Where 


^   ''12  •••  ^In 
^21  ^    •'•  ^2n 


♦     4 

>     • 


,  f 


l—        — 

1 —    — ' 

Plm 

,  R- 

^im 

P2m 

*'2m 

4 

« 

« 

• 

• 

• 

ma 

ikn 


'nl  'nfi  •••  ^ 

loting  that  by  multiplying  the  first  equation  in  (3')  by  p.  ,  the  second  by 
P/j_*  *•*»  tiie  nth  by  p  ,  one  obtains  the  set  of  equations 


""am 


"nm 


^im^'lm  'Pirn  *  ^12PlmP2m  *  '"  *  'in^laiP, 


nffl 


P2m^2m  "  ^ai^am^lm  *  ^Sm    *  ••'  *  ^gn^H^) 


nm 


(5') 


Hov  by  Bidding  the  n  equations  in  (5')*  one  has 

n  n  p        n      n 

;:       .        il  Pim^im  =i?L  Pim     *  ftl  A  ^im^jm^J'   ^  >  ^ 


but  realizing  that  p   ,  thu» 
n 


^  'ik  Pim^im  *  V    'i&.  Pii.     =  ?&.  j&.  PimViJ  *  ^qpi  '   ^  ^  ^' 


n   n 


(6') 


(?•) 


This  is  the  saiae  result  which  Wright  (l960a)  obtains.  But  in  matrix  notation 
what  amounts  to  the  sanie  thing  is  matrix  multiplication  on  the  l«ft  of  each 
side  of  equation  (4»)  by  F'. 


sn 


P'(AP)  =  P»R 
but  by  the  associative  lav  for  matrix  multiplication 

(P'A)P  =  P»R 
•0 

BP  »  S  (8') 

vhere 

B  •  P'A  and  S  =  P'R. 
But  ve  should  notice  that 

P'A  ■  (pt  +  p.  r„  +  . . .  +  p  r  ,,...,  p,  r,  +...-»•  p  J 

'Im  *'2m  21        '^im  nl'    '  'Im  In        *^nai 

Im  2m     nm* 
•  B 


nm 


P'R  »  p,  r,  -f  VnT-.    ♦  ...  4  p  r 

•*xm  Im   *^2m  2m        *^nra  c 

!Bi«tefore,  corresponding  to  «fLi»ti(»i  il*),  Vti  have 

1  »  S  +  (1  -  S)  -  8  +  Q.  (9») 
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Basically,  systems  of  correlation  coefficients  can  lie  handled  from  two 
points  of  view.  One  may  consider  the  purely  statistical  description  of 
such  aystems  or  one  may  consider  the  interpretation  of  caiisal  relations 
(Wright  195^).  The  latter  has  been  considered  in  this  thesis  for  the 
special  case  of  a  determinately  closed  symmetric  multivariate  system. 

Hhe  method  is  presented  In  matrix  form  for  ease  of  handling  and  a 
practical  solution  for  higher  order  systems  is  provided  "by  a  computer  pro- 
gram. The  matrix  equation 

AP-B 
is  associated  vith  a  multivariate  model  which  for  lack  of  a  better  means 
is  represented  by  a  diagram  similar  to  Wright 's  (193^^,  1960a) . 

A  determinately  closed  system  is  defined  as  well  as  necessary  terras 
such  as,  direct  and  indirect  effects,  for  the  purpose  of  presenting  an 
analysis  of  correlation.  The  analysis  is  presented  in  tabled  form  for 
clarity  and  convenience,  (tables  1  and  2) .  The  determination  of  the  system 
is  then  derived  and  it  is  found  that  by  partitioning  the  sum  of  the  terms 
for  the  expression  of  the  coefficient  of  determination  that  the  relative 
importance  of  each  factor  may  be  accounted  for.  The  entire  method  is  then 
■mull  1  I  n  by  the  use  of  table  3> 

A  numerical  exan5>le  is  presented  to  support  the  usefulness  of  such  a 
method,  namely,  the  publication  which  appeared  in  1959  hy  Dewey  and  Lu. 
From  this  publication,  it  is  quite  apparent  that  in  seeking  a  selection 
program  to  use  in  order  to  increase  seed  yield  In  the  crested  wheatgrass. 


one  cannot  depend  only  upon  the  inspection  of  correlation  coefficients  to 
give  an  adeqiiate  solution  to  the  problem.  However,  upon  utilization  of 
the  method  of  path  coefficients,  the  authors  were  able  to  suggest  a 
selection  program  to  follow.  It  happens  that  the  path  analysis  model 
chosen  by  Devey  and  I*i  is  coirtpletely  analogous  to  the  method  presented 
in  tMs  thesis. 

A  practical  means  for  the  application  of  the  method  for  higher  order 
systems  is  furnished  by  the  computer  program.  Then,  presented  in  detail, 
is  the  function  of  the  piograa,  a  mathematical  discussion,  as  well  ei«  * 
flow  chart  with  description  and  a  fortrau  program.  The  numerical  exai^ple 
is  then  used  as  a  sample  output  for  the  computer  program. 

Some  of  the  nmin  points,  such  as  the  basic  assumptions  underlying  the 
method  and  when  one  will  have  a  solution,  axe  then  smomarized. 


